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CLAIMS 

What is claimed is: 

1. In a server, a method for providing information suitable for audio output, the 
method comprising: 

5 receiving a first set of information over a network based on a request for 

the first set of information; 

accessing a tagged document in response to receiving the first set of 
information; and 

generating a second set of information suitable for audio output based on 
10 the first set of information and the tagged document. 

2. The method of claim 1, wherein: 

the step of receiving the first set of informafion comprises receiving a web 
page based on a Uniform Resource Locator (URL) request for the web page; 
15 the step of accessing the tagged document comprises accessing an 

Extensible Markup Language (XML) document; and 

the step of generating the second set of information comprises generating 
filtered web content suitable for audio output based on the web page and the 
XML document. 
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The method of claim 1 , wherein the step of receiving the first set of information 
based on a request for the first set of information comprises the steps of: 

receiving speech information specifying the first set of information; 

generating a text request for the first set of information based on an 
acoustic speech recognition (ASR) technique appHed to the speech information, 
and 

submitting the text request over the network. 

The method of claim 3, wherein the step of generating the text request comprises 
interpreting at least one primitive construct based on the speech information. 

The method of claim 4, further comprising the step of generating at least one 
additional primitive construct based on a request for a user-defined command. 

The method of claim 3, wherein the step of generating the text request comprises 
applying a case-logic technique to the speech information. 

The method of claim 1 , wherein the step of accessing the tagged document in 

response to receiving the first set of information comprises: 

determining an identity of the request for the first set of information, and 
accessing the tagged document based on the identity of the request. 
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The method of claim 7, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a destination of 
the request. 

The method of claim 1 , wherein the step of generating the second set of 
information suitable for audio output comprises: 

selecting at least one portion of the first set of information that is suitable 
for audio output; and 

generating the second set of information based on selecting the at least one 
portion of the first set of information. 

The method of claim 1 , wherein the step of generating the second set of 
information suitable for audio output comprises: 

generating text data suitable for audio output based on the first set of 
information and the tagged document, and 

generating audio data based on the text data. 

The method of claim 10, wherein the step of generating the text data suitable for 
audio output comprises generating at least one response and the step of generating 
the audio data based on the text data comprises applying a text-to-speech (TTS) 
technique to the at least one response. 

The method of claim 1 , wherein the step of accessing the tagged document is 
performed based on the request for the first set of information and approximately 
concurrently with the step of receiving the first set of information. 
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The method of claim 1 , wherein each of the first set of information, the tagged 
document, and the second set of information is at least one of a Hypertext Markup 
Language (HTML) page, an Extensible Markup Language (XML) page, a Virtual 
Reality Modeling Language (VRML) page, and a Standard Generic Markup 
Language (SGML) page. 

A system for providing information suitable for audio output, the system 
comprising: 

a document database configured for storing a plurality of tagged 
documents; and 

a server comprising an executable resource, wherein the executable 
resource 

receives a first set of information over a network based on a 
request for the first set of information; 

accesses a tagged document from the document database based on 
receiving the first set of information; and 

generates the second set of information suitable for audio output 
based on the first set of information and the tagged document. 

The system of claim 14, wherein the first set of information is a web page based 
on a Uniform Resource Locator (URL) request for the web page; the tagged 
document is an Extensible Markup Language (XML) document; and the second 
set of information is filtered web content suitable for audio output based on the 
web page and the XML document. 



CISOO-2410 

16. 



5 

17. 
18. 

10 

19. 

15 20 



-37- 

The system of claim 14, wherein the executable resource receives speech 
information specifying the first set of information, generates a text request for the 
first set of information based on an acoustic speech recognition (ASR) technique 
applied to the speech information, and submits the text request over the network. 



The system of claim 16, wherein the executable resource interprets at least one 
primitive construct based on the speech information. 



The system of claim 17, wherein the executable resource generates at least one 
additional primitive construct based on a request for a user-defined command. 



The system of claim 18, wherein the executable resource applies a case-logic 
technique to the speech information to generate the text request. 



The system of claim 14, wherein the executable resource determines an identity of 
the request for the first set of information, and accesses the tagged document 
based on the identity of the request. 
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The system of claim 20, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a destination of 
the request. 
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The system of claim 14, wherein the executable resource selects at least one 
portion of the first set of information that is suitable for audio output, and 
generates the second set of information based on selecting the at least one portion 
of the first set of information. 

The system of claim 14, wherein the executable resource generates text data 
suitable for audio output based on the first set of information and the tagged 
document, and the executable resource generates audio data based on the text 
data. 

The system of claim 23, wherein the text data comprises at least one response, and 
the executable resource applies a text-to-speech (TTS) technique to the at least 
one response to generate the audio data. 

The system of claim 14, wherein the executable resource, in an approximately 
concurrent time frame: 

accesses the tagged document based on the request for the first set of 
information, 

and receives the first set of information. 

The system of claim 14, wherein each of the first set of information, the tagged 
document, and the second set of information is at least one of a Hypertext Markup 
Language (HTML) page, an Extensible Markup Language (XML) page, a Virtual 
Reality Modeling Language (VRML) page, and a Standard Generic Markup 
Language (SGML) page. 
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A computer program product that includes a computer readable medium having 
instructions stored thereon for providing information suitable for audio output, 
such that the instructions, when carried out by a computer, cause the computer to 
perform the steps of: 

receiving a first set of information over a network based on a request for 
the first set of information; 

accessing a tagged document in response to receiving the first set of 
information; and 

generating a second set of information suitable for audio output based on 
the first set of information and the tagged document. 

The computer program product of claim 27, wherein: 

the step of receiving the first set of information comprises receiving a web 
page based on a Uniform Resource Locator (URL) request for the web page; 

the step of accessing the tagged document comprises accessing an 
Extensible Markup Language (XML) document; and 

the step of generating the second set of information comprises generating 
filtering web content suitable for audio output based on the web page and the 
XML document. 



A system for providing information suitable for audio output, the system 
comprising: 

a document database configured for storing a plurality of tagged document 

pages; 

means for producing a second set of information suitable for audio output, 

wherein the n/fcducing means 

receives a first set of information over a network based on a 
request for the first set of information; 

accesses a tagged document from the document database based on 
receiving the first set of information; and 

generates the second set of information suitable for audio output 
based on the first set of information and the tagged document. 

The system of claim 3 1 , wherein the first set of information is a web page based 
on a Uniform Resource Locator (URL) request for the web page; the tagged 
document is an Extensible Markup Language (XML) document; and the second 
set of information is filtered web content suitable for audio output based on the 
web page and the XML document. 



A computer program propagated signal product embodied in a propagated 
medium, having instructions for providing information suitable for audio output, 
such that the instructions, when carried out by a computer, cause the computer to 
perform the steps of: 

receiving a first set of information over a network based on a request for 
the first set of information; 

accessing a tagged document in response to receiving the first set of 
information; and 

generating a second set of information suitable for audio output based on 
the first set of information and the tagged document. 

The computer program propagated signal product of claim 29, wherein: 

the step of receiving the first set of information comprises receiving a web 

page based on a Uniform Resource Locator (URL) request for the web page; 
the step of accessing the tagged document comprises accessing an 

Extensible Markup Language (XML) document; and 

the step of generating the second set of information comprises generating 

filtered web content suitable for audio output based on the web page and the XML 

document. 
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A method for navigating a web by voice in a server configured for executing voice 
web applications, the method comprising: 

requesting a web page based on a voice web navigation request; 

receiving a retrieved web page based on the voice web navigation request; 

accessing an extensible markup language page in response to receiving 
the retrieved web page; 

generating filtered web content suitable for audio output based on the 
retrieved web page and the extensible markup language page; and 

generating the at least one audio output file based on the filtered web 
content , 

The method of claim 33, wherein the step of requesting the web page based on the 
voice web navigation request comprises the steps of: 

receiving speech information specifying the web page; 

generating a text request for the web page based on an acoustic speech 
recognition (ASR) technique applied to the speech information, and 

submitting the text request over the network. 



The method of claim 33, wherein the step of accessing the extensible markup 
language document in response to receiving the retrieved web page comprises: 
determining an identity of the voice web navigation request for the web 
page, and 

accessing the extensible markup language page based on the identity of the 
voice web navigation request. 

The method of claim 35, wherein the identity of the request is based on at least 
one of an identifier for an originator of the voice web navigation request and an 
identifier for a destination of the voice web navigation request. 
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37. The method of claim 33, wherein the step of generating the fiUered web content 
suitable for audio output comprises: 

generating text data suitable for audio output based on the retrieved web 
5 page and the extensible markup language document, and 

generating audio data based on the text data. 



